Evaluation of document binarization using eigen value decomposition

نویسندگان

  • Deepak Kumar
  • M. N. Anil Prasad
  • A. G. Ramakrishnan
چکیده

A necessary step for the recognition of scanned documents is binarization, which is essentially the segmentation of the document. In order to binarize a scanned document, we can find several algorithms in the literature. What is the best binarization result for a given document image? To answer this question, a user needs to check different binarization algorithms for suitability, since different algorithms may work better for different type of documents. Manually choosing the best from a set of binarized documents is time consuming. To automate the selection of the best segmented document, either we need to use ground-truth of the document or propose an evaluation metric. If ground-truth is available, then precision and recall can be used to choose the best binarized document. What is the case, when ground-truth is not available? Can we come up with a metric which evaluates these binarized documents? Hence, we propose a metric to evaluate binarized document images using eigen value decomposition. We have evaluated this measure on DIBCO and H-DIBCO datasets. The proposed method chooses the best binarized document that is close to the ground-truth of the document.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Evaluation of Document Binarization Results

Most of the document binarization techniques have many parameters that can initially be specified. Usually, subjective document binarization evaluation, employs human observes for the estimation of the best parameter values of the techniques. Thus, the selection of the best values for these parameters is crucial for the final binarization result. However, there is not any set of parameters that...

متن کامل

A Novel Degraded Document Image Binarazation by using Local Thresholding Segmentation

The proposed binarization is a scheme of parting a image pixel values into two classes black as foreground and white pixels as background then the thresholding is found for well known scheme for document image binarization. In this proposed work for the decomposition of both global and local thresholding this basic thresholding value we can use further. Here the global thresholding scheme is ef...

متن کامل

Abstract-Papers up to 4 pages should be submitted using this format

Abstract This paper presents a robust watermarking for still digital images based on schur factorization and Singular Value Decomposition (SVD). In this paper, image is decomposed into 8 8  blocks and after applying schur factorization, the stable largest eigen values of the upper triangle is used as robust locations for embedding watermark. The matrix array formed by largest eigen values furt...

متن کامل

Evaluation of a New Eigen Decomposition Algorithm for Symmetric Tridiagonal Matrices

This paper focuses on a new extension version of double Divide and Conquer (dDC) algorithm to eigen decomposition. Recently, dDC was proposed for singular value decomposition (SVD) of rectangular matrix. The dDC for SVD consists of two parts. One is Divide and Conquer (D&C) for singular value and the other is twisted factorization for singular vector. The memory usage of dDC is smaller than tha...

متن کامل

Estimation of Proper Parameter Values for Document Binarization

Most of the existing document-binarization techniques deal with many parameters that require a priori setting of their values. Due to the unknown of the ground-truth images, the evaluation of document binarization techniques is subjective and employs human observers for the estimation of the appropriate parameter values. The selection of the appropriate values for these parameters is crucial an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013